Optimal Selection of Proportional Bounding Quantifiers in Linguistic Data Summarization

نویسنده

  • Ingo Glöckner
چکیده

Proportional bounding quantifiers like “Between p1 and p2 percent” are potentially useful for expressing linguistic summaries of data. Given p1, p2, existing methods for data summarization based on fuzzy quantifiers can be used to assign a quality score to the summary. However, the problem remains how the optimal choice of p1, p2 in the range 0≤ p1 ≤ p2 ≤ 100% can be established. Moreover, the proposed quality indicators are rather heuristic in nature. The paper presents a method for computing the optimal bounding quantifier which best summarizes the given data. Specifically, the most specific quantifier will be chosen which results in the highest validity score of the summary given a constraint on the the percentage range p2− p1. The method not only assigns validity scores to the quantifiers of interest but also determines the best choice of quantifier in O(N logm) time, where N is the size of the base set and m the number of different membership grades in the fuzzy arguments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Quantifiers for Data Summarization and their Role in Granular Computing

Data summarization is an enabling technique of Granular Computing, because of its promise to abstract from individual observations and to view a phenomenon as a whole. The linguistic summaries are built around a fuzzy quantifier which functions as the ‘summarizer’. Linguistic data summarization therefore presupposes an underlying model of fuzzy quantifiers, which is of crucial importance to the...

متن کامل

Working Papers of the IJCAI-2013 Workshop on Weighted Logics for Artificial Intelligence

Quantifiers have the ability of summarizing the properties of a class of objects without enumerating them. This talk introduces a framework for modeling quantifiers in natural languages in which each linguistic quantifier is represented by a family of non-additive measures, and the truth value of a quantified proposition is evaluated by using Sugeno’s integral. Some elegant logical properties o...

متن کامل

Protoforms of Linguistic Database Summaries as a Human Consistent Tool for Using Natural Language in Data Mining

We consider linguistic database summaries in the sense of Yager (1982), in an implementable form proposed by Kacprzyk & Yager (2001) and Kacprzyk, Yager & Zadrożny (2000), exemplified by, for a personnel database, “most employees are young and well paid” (with some degree of truth) and their extensions as a very general tool for a human consistent summarization of large data sets. We advocate t...

متن کامل

ارائه سیستم خلاصه ساز متون فارسی برمبنای ویژگی های زبان شناختی و رگرسیون

Considering the vast amount of existing written information and the shortage of time, optimal summarization of books, articles, news reports, etc. on the Web is a major concern of researchers. In this paper, we propose a new approach for Persian single-document Summarization based on several linguistic features of text. In our approach after extracting the linguistic features for each sentence,...

متن کامل

An extended intuitionistic fuzzy modified group complex proportional assessment approach

Complex proportional assessment (COPRAS) methodology is one of the well-known multiple criteria group decision-making (MCGDM) frameworks that can focus on proportional and direct dependences of the significance and utility degree of candidates under the presence of mutually conflicting criteria in real-worldcases. This studyelaboratesa newintuitionistic fuzzy modified group complex proportional...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006